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Abstract. Perhaps the two most significant theoretical questions about the programming of self- 
assembling agents are: (1) necessary and sufficient conditions to produce a unique terminal assembly, 
and (2) error correction. We address both questions, by reducing two well-studied models of tile assembly 
to models of distributed shared memory (DSM), in order to obtain results from the memory consistency 
systems induced by tile assembly systems when simulated in the DSM setting. The Abstract Tile 
Assembly Model (aTAM) can be simulated by a DSM system that obeys causal consistency, and the 
locally deterministic tile assembly systems in the aTAM correspond exactly to the concurrent-write 
free programs that simulate tile assembly in such a model. Thus, the detection of the failure of local 
determinism (which had formerly been an open problem) reduces to the detection of data races in 
simulating programs. Further, the Kinetic Tile Assembly Model can be simulated by a DSM system that 
obeys GWO, a memory consistency condition defined by Steinke and Nutt. (To our knowledge, this is 
the first natural example of a DSM system that obeys GWO, but no stronger consistency condition.) We 
combine these results with the observation that self-assembly algorithms are local algorithms, and there 
exists a fast conversion of deterministic local algorithms into deterministic self-stabilizing algorithms. 
This provides an "immediate" generalization of a theorem by Soloveichik et al. about the existence of 
tile assembly systems that simultaneously perform two forms of self-stabilization: proofreading and self- 
healing. Our reductions and proof techniques can be extended to the programming of self-assembling 
agents in a variety of media, not just DNA tiles, and not just two-dimensional surfaces. 
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1 Introduction 



In August 2009, IBM and the DNA and Natural Algorithms Group at Caltech announced a joint project 
to use "DNA origami" as a scaffolding in order to place microchip components 6 nm apart, breaking the 
22 nm barrier that is the current state of the art in chip manufacturing [1]. In contrast to some other 
emerging models of computation, such as quantum computing or membrane computing, about which there 
are extensive theoretical results but as yet little experimental success, the experimental results of algorithmic 
DNA self-assembly are significantly ahead of the theory. (A recent survey of nanofabrication by DNA self- 
assembly, including a high-level explanation of DNA origami, appears in [5].) Indeed, there are only a handful 
of results about perhaps the two most significant theoretical questions about self-assembly programming: 
(1) necessary and sufficient conditions to produce a unique terminal assembly, and (2) error correction. 
The goal of this paper is to recast those two questions as programming questions of memory consistency 
conditions and self-stabilizing algorithms, thus making techniques from the study of concurrent architectures 
and programming languages, and self-stabilization, available to this emerging area of research. 

When a global structure (or organism) forms because of the connections formed by strictly simpler 
structures to one another, following only local rules, we say the global structure self-assembles. The goal of 
algorithmic self-assembly is to direct (or to program) the self-assembly of desired structures, by constructing 
self-assembling agents, and their environment, so they combine to form a desired result. We will focus on 
algorithmic DNA self-assembly in this paper, a field merging computer science and nanotechnology that began 
in the 1990s, spurred especially by the work of Adleman, Rothemund, and Winfree [3j. The formalisms to 
model self-assembling systems contain the following: a finite set of distinct types of self-assembling agents, a 
set of local binding rules that completely determines the behavior of the agents, and an initial configuration 
of the system. A particular self-assembly "run" starts with an operator placing a finite seed assembly on the 
surface, and then allowing a "solution" containing infintely many of each agent type to mix on the surface. 
Agents bind to the seed assembly, and to the growing configuration, consistent with the local rules, and 
in a random, asynchronous manner. In the tile assembly models we consider in this paper, each agent is a 
four-sided tile, and the assembly surface is the two-dimensional integer plane. 

In [1], we proved a time lower bound for certain computational problems in self-assembly on a surface, 
by reducing a class of self-assembly models to message-passing models of distributed computing, and then 
applying a known impossibility result about local algorithms. We follow a similar strategy in this paper: we 
reduce well-studied tile assembly models to models of distributed shared memory and then explore some of 
the consequences of those reductions. In particular, we consider the Abstract Tile Assembly Model (aTAM), 
due to Winfree [5] and Rothemund [6] , and the Kinetic Tile Assembly Model (kTAM) due to Winfree |5| . In 
the aTAM, binding between self-assembling agents is error-free and irreversible; while in the kTAM, binding 
errors are possible, and agents can bind but later dissociate with some positive probability. We show that tile 
assembly systems defined in the aTAM can be simulated by systems of distributed shared memory (DSM) 
that obey causal consistency [7], a memory consistency condition weaker than the better- known sequential 
consistency. In a sense, this level of memory consistency is "tight" for any DSM model that simulates the 
aTAM. 

Next, we translate one of the fundamental theorems about the aTAM — that "locally deterministic" tile 
assembly systems produce a unique terminal assembly ^ — into the language of memory consistency. We show 
that locally deterministic tile assembly systems correspond exactly to the concurrent-write free programs 
that simulate tile assembly in our DSM model. Hence, the programming techniques to produce data-race 
free and concurrent-write free programs — and to detect data races — can be applied to the programming of 
self-assembling agents. 

Regarding the kTAM, we show it reduces to a model of DSM obeying memory consistency condition 
GWO, which is strictly weaker than causal consistency. Again, there is a sense in which this level of memory 
consistency is tight. GWO was defined by Steinke and Nutt, to fill out a lattice with which they compared 
all known memory consistency conditions f9]. To our knowledge, the only DSM system in the literature that 
precisely obeys GWO is the one Steinke and Nutt built to show that some such model exists. The DSM 
simulation of the kTAM, then, is the first natural example of a model that lies within GWO but no stronger 
level of memory consistency. 



1 



Finally, we combine these results with the observation that self-assembly algorithms are local algorithms, 
and there exists a fast conversion of deterministic local algorithms into deterministic self-stabilizing algo- 
rithms. This provides an "immediate" generalization of a theorem by Soloveichik et al. about the existence 
of tile assembly systems that simultaneously perform two forms of self-stabilization: proofreading and self- 
healing. Our general reduction and proof techniques can be extended to the programming of self-assembling 
agents in a variety of media, not just DNA tiles, and not just two-dimensional surfaces. 

Several researchers have voiced intuitions about a connection between self-assembly and distributed 
computing, for example Klavins [TO], or an Arora et al. 2007 NSF Report [TT|. However, the first rigorous 
application of the theory of distributed computing to questions in self-assembly appeared in [4]. Subsequent 
to ID (and its extended version [H]), we used the wait-free consensus hierarchy to separate models of 
self-assembly based on their synchronization power |13j : and we showed that graph assembly systems (a 
graph grammar self-assembly formalism due to Klavins) are distributed systems in a strong sense [Tl]. To 
the best of our knowledge, the current paper is the first to consider self-assembly within the context of 
memory consistency models, and the first to apply multiprocessor programming techniques to biomolecular 
computation 

The rest of the paper is structured as follows. Section [2] provides background on tile assembly models 



we reduce the Abstract Tile Assembly Model to a DSM 
3.2 we reduce the Kinetic Tile Assembly Model to a DSM 



and memory consistency models. In Section |3.1 
system that obeys causal consistency. In Section 
system that obeys GWO. In Section |4] we show how to generalize an existence theorem about proofreading 
and self-healing tilesets by using techniques from self-stabilizing algorithms. Section [5] concludes the paper 
and suggests directions for future research. 



2 Background 

2.1 Tile assembly background 

We now give the formal definitions of the tile assembly models we will work with. 



Abstract Tile Assembly Model Winfree's objective in defining the Abstract Tile Assembly Model was 
to provide a useful mathematical abstraction of DNA tiles combining in solution in a nondetcrministic, 
asynchronous manner 5, . Rothemund |6j , and Rothemund and Winfree [15, , extended the original definition 
of the model. For a comprehensive introduction to tile assembly, we refer the reader to |^. Intuitively, we 
desire a formalism that models the placement of square tiles on the integer plane, one at a time, such that 
each new tile placed binds to the tiles already there, according to specific rules. Tiles have four sides (often 
referred to as north, south, east and west) and exactly one orientation, i.e., they cannot be rotated. 

A tile assembly system T is a 5-tuple (T, cr, r, R), where T is a finite set of tile types; a is the seed tile 
or seed assembly, the "starting configuration" for assemblies of T; r : T x {iV, E, W} ^ x {0, 1, 2} is an 
assignment of symbols ("glue names") and a "glue strength" (0, 1, or 2) to the north, south, east and west 
sides of each tile; and a symmetric relation R C S x S that specifics which glues can bind with nonzero 
strength. In this model, there are no negative glue strengths, i.e., two tiles cannot repel each other. 

A configuration ofT is a set of tiles, all of which are tile types from T, that have been placed in the plane, 
and the configuration is stable if the binding strength (from t and R in T) at every possible cut is at least 
2. An assembly sequence is a sequence of single-tile additions to the frontier of the assembly constructed at 
the previous stage. Assembly sequences can be finite or infinite in length. The result of assembly sequence 
a is the union of the tile configurations obtained at every finite stage of a . The assemblies produced by T is 
the set of all stable assemblies that can be built by starting from the seed assembly of T and legally adding 
tiles. If a and /? are configurations of T, we write a — > (3 if there is an assembly sequence that starts at a 
and produces [3. An assembly of T is terminal if no tiles can be stably added to it. 

We are, of course, interested in being able to prove that a certain tile assembly system always achieves a 
certain output. In [8J, Soloveichik and Winfree presented a strong technique for this: local determinism. An 
assembly sequence ~a is locally deterministic if (1) each tile added in a binds with the minimum strength 
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required for binding; (2) if there is a tile of type to at location I in the result of a, and to and the immediate 
"OUT-neighbors" of to are deleted from the result of ~a, then no other tile type in T can legally bind at 
the result of a is terminal. Local determinism is important because of the following result. 

Theorem 1 (Soloveichik and Winfree |8]). IfT is locally deterministic, then T has a unique terminal 
assembly. 

Kinetic Tile Assembly Model The Kinetic Tile Assembly Model (kTAM) was defined by Winfree [S], 
to provide a mathematical model for self-assembly (and disassembly) in solution, based on the kinetics of 
chemical reactions. Slightly different versions of the kTAM appear in different papers on the subject. We 
will follow the treatment in |16j . because we will use techniques from distributed computing to address an 
open question in [16] . 

Whereas the aTAM is an error-free, irreversible, nondeterministic model, the kTAM is a probabilistic 
model in which tiles bind with some probability of error, and bound tiles can dissociate with some probability. 
These probabilities are derived from the equations of chemical reaction kinetics. There is a forward rate /, 
which we assume is the same for any tile type at any position of the perimeter in the growing assembly, 
defined as / = fc/e~'^""=, where kf is a constant that sets the time scale, and Gmc is the logarithm of 
the concentration of each tile type in solution. We assume that tiles can only fall off of the perimeter of 
the assembly; this assumption matches experimental observation. The rate of dissociation (reverse rate ri,) 
depends exponentially on the number of bonds that must be broken: rt = kfe~^'^"', where h is the total 
interaction strength with which the tile is attached to the assembly, and Gse is the unit bond free energy, 
which may depend on the overall temperature of the system. 

As with our treatment of the aTAM, we assume that "strength 2" bonds are sufficient for tiles in the 
kTAM to bind stably. Hence we let f — r2, which ensures that the forward growth of the kTAM mirrors 
(with high probability) the binding rules of the aTAM, and incorrectly bound tiles (with high probability) 
quickly dissociate. We assume that fc/ is a physical constant that cannot be experimentally controlled, but 
by changing concentrations or temperature we could change Gmc and/or Gse- 

One objective of this paper is to develop proof techniques that would be applicable to models of self- 
assembly other than the aTAM and kTAM. To this end, we will not use all the information available about 
the kTAM when we simulate it, but rather will use a more general construction, with the understanding that 
specific values for probabilities could be plugged in as required, based on the rate equations of the kTAM. 
In particular, we will limit ourselves to the existence of a forward rate / which is the same for each bond, a 
reverse rate which is much higher for erroneous bonds than for correct ones, and an error probability tTe 
that a tile will bind incorrectly to the frontier of the assembly. 

2.2 Distributed shared memory background 

A distributed shared memory model (or system) is a model of distributed processors and (possibly shared) 
read/write registers. A processor p can perform a read or a write on register r. Hp has permission to perform 
that operation on r. The only operations a processor can perform on a register are reads and writes. The 
read operation begins with an invocation and terminates when p receives a value. The write operation begins 
with an invocation that includes a value, and ends when p receives an ack. A processor can only perform 
one operation at a time. We do not assume atomicity of reads and writes to a given register r. 

A memory consistency model specifies the allowable behavior of memory. Study of memory consistency 
models arose from a conflict between the goals of hardware and compiler designers, to permit aggressive 
optimization (which requires "weak" memory consistency) , and the desire of programmers to have concurrent 
code execute in a predictable fashion (which requires "strong" memory consistency). We refer the reader 
to [17] for a survey and tutorial on these issues. In 2004, Steinke and Nutt presented a theory that unified the 
various memory consistency models that had been proposed in the literature [9 . They showed the existence 
of a lattice of 13 memory consistency models; this lattice contained all known models, and showed the logical 
interrelation between each. We now describe the memory consistency conditions that will be most important 
for this paper. 



3 



A system of distributed processors is sequentially consistent [TH] if the result of any execution is the same 
as if the operations of all the processors were executed in some sequential order, and the operations of each 
individual processor appear in this sequence in the order specified by its program. A system of distributed 
processors is causally consistent |7j if for each processor the operations of that processor plus all writes known 
to that processor appear to that processor to appear in a total order that respects potential causality. A 
system of distributed processors is PRAM consistent |19j if writes performed by a single processor are seen 
by all other processors in the order in which they were issued, but writes from different processors may be 
seen in different orders by different processors. These three consistency conditions are in descending order of 
strength: a sequentially consistent system is causally consistent, and a causally consistent system is PRAM 
consistent, but the converses do not always hold. 

To compare consistency conditions, Steinke and Nutt defined logical properties about processor views of 
a DSM system. A processor view is a total order on a subset of operations that occurred during an execution 
of a DSM system, the subset being those operations an individual processor performed or could deduce 
occurred. The property GPO ("global process order") is the condition that there is global agreement on the 
order of operations from each process. The property GWO ("global write-read- write order") is the condition 
that there is global agreement on the order of any two writes when a process can prove it has read one before 
the other. A DSM system satisfies GPO exactly when it is PRAM consistent, and it satisfies both GPO and 
GWO exactly when it is causally consistent. 

A relation is a causality order of operations if oi -< 02 means that one of the following holds for any 
operations oi and 02: 

1. oi and 02 were performed by the same processor p, and p executed oi before 02. 

2. 02 reads the value written to a shared register by oi. 

3. There is some other operation o' such that oi ^ o' 02. 

We will say that program P is concurrent-write free in DSM model A4 if every processor in A4 runs P, and 
there is no legal execution history H of P such that the causality order induced by H contains two writes 
wi and W2 such that wi 7^ W2 and W2 7^ wi {i.e., there are no writes that are causally concurrent under 
any possible execution). In other words, P is concurrent- write free if a single program Q that simulates each 
processor in the system executing P is concurrent-write free under the standard definition that no execution 
of Q contains conflicting writes. 

3 Reduction of tile assembly models to distributed shared memory models 
3.1 The aTAM reduces to causally consistent models of DSM 

The objective of this section is to show that the aTAM can be simulated by a causally consistent DSM system, 
and, under a reasonable definition of "simulation," no DSM system that fails to obey causal consistency can 
simulate the aTAM. First, we define formally what it means for a DSM system to simulate a tile assembly 
system. For simplicity, we limit consideration to tile assembly systems that self-assemble on the first quadrant 
of Z^; our definitions could be extended to the entire integer plane. 

Definition 1. Let T = (T, a, Z", r, i?) he a tile assembly system in the aTAM, and M be a model of distributed 
processors. We say AI simulates T on the k x k surface if the following holds. 

1. The network topology of M is that of a k x k square. 

2. Each processor pij (0 <i,j< k) in M has a write-once output register pij that is initialized to value 
EMPTY, and can take on \T\ distinct values other than EMPTY. 

3. There is a bijection tp from tile types in T to possible values of p (not including EMPTY). 

4-. For each legal tile assembly sequence a of T , there is a legal execution E of M such that, if tile type t 
is placed on coordinate {i,j) in a , then processor p^j writes ipit) to pij in E. Moreover, processors write 
to their respective p in E in the same order that tiles get placed on the surface in a . (The placement of 
the seed assembly a is simulated by writing the value ^^{t) to ptj if a{i,j) = t.) 
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5. For each legal execution E of M , there is a corresponding legal assembly sequence a . such that if pij 
writes ip(t) to pij, then tile type t is placed on location {i,j) in a . Moreover, the order of writing values 
in E is preserved by the order of placing tiles in "a . 

Intuitively, M simulates T on a finite surface if each process in M behaves Uke a location on the surface, with 
each processor executing a local algorithm that mimics the binding rules required by R. We now generalize 
the above definition to arbitrary tile assembly systems. 

Definition 2. Let T be a tile assembly system in the aTAM. We say a class M. = {Mq, Mi, . . .} of DSM 
models simulates T if, for each A; e N, Mk simulates T on a k x k surface. Let 4> be a mapping from tile 
assembly systems to algorithms. Then we say that {AA, (f)) simulates the aTAM if, for any tile assembly system 
T, M. simulates T when the processors in the models in M. run <j){T) as their local algorithm, beginning at 
an initial state determined by the seed assembly a as above. 

We now prove that there exists a class of causally consistent DSM models that simulates the aTAM. 

Theorem 2. There exists a class of DSM models A4 that simulates the aTAM. Each M G A4 is causally 
consistent. Further, the models in A4 do not obey a memory consistency condition in Steinke and Nutt's 

lattice that is stronger than causal consistency. 

Proof. Fix A; e N. We define a DSM model Mk as follows. M contains k^ processors, with network graph of 
a, k X k grid. We will refer to the processors as pij < k), to denote the processor at location in the 
network grid. Note this is a convenience for the proof; the processors do not have unique ID's, and do not 
know whether they are on the edge, the corner, or the interior of the grid. We assume that all tile assembly 
systems have temperature 2, as that is sufficient for Turing universality. Each processor can read from two 
registers, rjj and rfj. Each processor can write to twelve registers, r^, and Index of each of its neighbors 
to the north, south, east and west on the grid. (For processors on the edge of the grid, these registers exist, 
and ack when written to, but no processor ever reads from them if they "belong" to nonexistent processors. 
This way, a processor cannot deduce that it is on the edge of the grid.) To simulate r > 2, we could use 
registers , . . . , instead of just two such registers per processor. 

The register Indexij is initialized to the value 1, and can take 3 possible values: 1, 2, and "1 and 2." The 
algorithm each processor runs will look first at the value in Index to determine whether to write to ri, r2 
or both. 

Each processor pij has a write-once register pij initialized to the value EMPTY. Only pij can write to 
Pij, and it can write one of \T\ distinct values. Fix a bijection ip between tile types of T and possible values 
that can be written to each p. The processors of Mk all run a common nondeterministic, distributed, local, 
algorithm as follows. Before starting execution, Mk is configured to simulate the seed assembly of T: for 
i,j < k, if a{i,j) ^ then pij writes i^{a{i,j)) to pij. 

Once placement of the seed assembly is simulated, execution of the algorithm proceeds in s}mchronized 
stages (rounds), beginning at stage s = 0. At the start of time stage s, if at stage s — 1, Pij wrote a value 
'tp{t) (for t €T) to Pij, then pij writes to each of its neighbors in a way that communicates the glues and glue 
strengths of the tile Pij is simulating, as follows. (We limit discussion to communication with the neighbor 
to the north; communication to the other neighbors is similar.) 

First, Pij reads Indexij^i. If pij wants to communicate a bond with strength 2 to its northern neighbor, 
then it writes its message in both rlj_^_-^ and rfj_^_i. If it wants to communicate a bond with strength 1, then 
Pij writes (S,,^) to to or to both registers, based on the value it read from IndeXij+i; here g is 

a message that corresponds to the glue type on the north side of t. This iudiciates to the northern neighbor 
of Pij that glue type g is present immediately to the south. More generally, each message written to or 
of a given processor will be of form {d,g) where d € {N, S, E, W} is a direction, and g is a, glue type. 

After processors write, the second phase of stage s takes place as follows. The algorithm of exactly one 
processor pij (chosen nondeterministically from the set of processors that could legally write a value to their 
respective p using the protocol explained below) with pij still set to EMPTY chooses (again nondeterminis- 
tically) which message it will "hear" in each of rjj and rfj, of all the messages that have been written to rjj 
and rfj since the start of execution of the algorithm. Processor pij then writes a value to pij, by applying tp 
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and '4'~^ to the binding rules induced by the relation R of T, so pij writes tp{t) for some t such that t can 
legally bind given neighbors with glue types as indicated by the values of rjj and rf^ . If more that one tile 
type can legally bind given the same set of neighbors, the value ipit) is again chosen nondeterministically 
from the set of legal values. 

Finally, pij "increments" the value of Index of each of its neighbors that it wrote to. Continuing with 
our example of writing to the northern neighbor, if the value is 1, it writes the value 2 to Indexij+i. If the 
value is 2, it writes "1 and 2" to Indexij+i. This concludes stage s of the algorithm. 

Mfc simulates T on a, k x k surface in the "natural" way. For each nondeterministic run of T, i.e., each 
tile assembly sequence a , there is a nondeterministic execution of such that the behavior of each pij 
mimics the behavior of the location in ~a. Similarly, for each nondeterministic execution E of M^, there 
is a legal tile assembly sequence that makes the same nondeterministic choices, since the choices of E are 
constrained by the binding rules that determine the legal behavior of any assembly sequence of T. 

Since our choice of k was arbitrary, we can define A4 ~ {M^ \ k e N}. Such an A4 simulates T. Further, 
our definition of each M^. was uniform with respect to any set of binding rules determined by a given R. So 
we can define a class of models A4' such that each M'f, runs an algorithm that simulates a and R for any tile 
assembly system. Hence there is a class of models A4' that simulates the aTAM. 

We turn now to the memory consistency conditions obeyed by the M^. First, each must be causally 
consistent, because each processor writes only once to any memory location, and writes deterministically 
based on values read from other writes (values written either to and r^), or at stage to simulate the 
seed assembly, which is a finite, completely determined set of decisions. Therefore, for any process pij, the 
operations of that process, and any writes known to that process, occur in a total order that respects potential 
causality, even though the values of those writes were nondeterministically chosen. 

Causal consistency is an upper bound for the memory consistency of the as well. This is because 
the only memory consistency conditions that are stronger than causal consistency in the lattice by Steinke 
and Nutt are conditions that include the property GDO (Global Data Order, which is equivalent to cache 
consistency). But makes no guarantee that all writes to the same memory location are performed in 
some sequential order: a write to or that occurs at an earlier stage than another write may still be 
nondeterministically chosen as the value of that register in a legal execution. So causal consistency exactly 
captures the memory consistency of this simulation of the aTAM. 

It is worth noting that if a DSM model is going to simulate the behavior of a tile assembly system in the 
aTAM, then causal consistency is the weakest memory consistency model it can follow. This is so because 
causal consistency is the combination of properties GPO and GWO. If a DSM model does not satisfy GPO 
{i.e., does not satisfy PRAM consistency) , then consider processor pij that writes multiple times to a neighbor 
q, yet q does not see these writes in the order in which they were issued. There is no legal tile assembly 
sequence that corresponds to such an execution, as tiles in the aTAM are placed one at a time, error-free, 
based on information transmitted to neighboring locations by already-placed tiles. Similarly, if a DSM model 
does not satisfy GWO, there exists some execution where qi and q2 disagree on the ordering of writes, even 
though qi can prove that a particular write happened first. As before, no legal tile assembly sequence captures 
this behavior, as a tile assembly sequence induces a total order on the system, such that at each stage of 
assembly, a newly placed tile communicates its glue types to all neighboring locations. So with respect to 
the lattice defined by Steinke and Nutt, causal consistency is "tight" for simulations of the aTAM. 

We now show that, under our reduction, locally deterministic tile assembly systems correspond to an 
important class of simulating programs. Let P be a program that simulates a tile assembly system; we will 
call P binding-rule determined if for each set of messages that simulates strength 2 bonds, there is at most 
one value P writes to p to simulate a tile type. It is easy to check whether P is binding-rule determined, but 
harder to check whether a tile assembly system is locally deterministic, as we discuss below. 

Theorem 3. Let Ai be the class of DSM systems in Theorem^ Then ifT is a locally determinstic assembly 
system, the program P for which {Ai,P) simulates T is concurrent-write free. Conversely, if M running 
concurrent-write free, binding-rule determined, program P , simulates tile assembly system T , then T is locally 
deterministic. 
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Proof. Let T be a locally deterministic tile assembly system, let a be a legal assembly sequence of T, 
and let (x, y) be a location in the result of a . As before, we assume we are operating within the aTAM 
at temperature 2. Either {x,y) is part of the seed assembly, or, in the sequence ~a, prior to a tile being 
placed at {x,y), either one neighbor of {x,y) has a tile placed with a strength 2 bond incident to {x,y), 
or two neighbors have tiles placed with strength 1 bonds incident to {x,y). The simulation of a in via 
the reduction of Theorem [2] produces a concurrent-write free program, as for processor p^y, each of rl.y and 
r'^y is written to at most once; and, if two neighbors of Pxy write to IndeXxy with writes wi and W2, in all 
legal execution histories, there will be an operation of one processor reading the value written by the other 
processor to Index in between wi and W2, so either wi -< W2 or W2 ^ wi, because of the definition of causal 
ordering. 

Conversely, suppose {M,P) simulates a tile assembly system T, and P is concurrent- write free. Then 
there is no execution history for {A4, P) that contains concurrent writes on rjj and rfj for any pij. But this 
means that at most two neighbors could have written to the registers of pjk, as otherwise two neighbors qi 
and q2 would have written to rjj , and causal consistency permits histories in which those writes could happen 
in either order. So the writes are concurrent after all, contrary to assumption. So at most two neighbors 
write to any location before that location decides which tile type to simulate. Further, suppose one of the 
neighbors of pij that writes to the registers of pij before pij decides, writes to pij with a strength 2 bond. 
Then that neighbor writes to both rj^ and rfj , so no other neighbor can write to the registers of pij before pij 
decides, or there will be concurrent writes, by the above argument. Hence the neighbors writing to pij write 
messages that simulate exactly a strength 2 bond. Finally, since P is binding-rule determined, the conditions 
of local determinism are satisfied, and T must be locally deterministic. 

"T is locally deterministic" is an undecidable property, as the standard tile assembly Turing machine simula- 
tion is locally deterministic, and it could be modified to do something not locally deterministic iff a machine 
achieves a halting state. Nevertheless, it would be useful to test for that property when programming — and 
debugging — a tile assembly system, hence self-assembly simulation and programming tools have attempted 
to include that functionality [2D] [51] . Theorem [S] classifies this problem precisely, and indicates that pro- 
gramming techniques to ensure [22! , and detect [23,, data-race freedom and concurrent- write freedom in 
parallel systems can be used productively to program self-assembling agents. 

3.2 The kTAM reduces to GWO-consistent models of DSM 

Whereas the aTAM was a nondeterministic model, the kTAM is a stochastic model. In order to apply a 
result about self-stabilizing algorithms to the kTAM, we will construct a DSM simulation whose processors 
run deterministic algorithms and whose registers return values probabilistically, for example by providing 
erroneous information with some nonzero probability. We use the same definition as above for what it means 
for a DSM system to simulate a tile assembly system, except substituting "kTAM" for "aTAM" when it 
appears, and the pij can be written to multiple times, not just once, since the kTAM is a reversible model 
in which binding errors can occur. It turns out that the probabilistic behavior of the registers is captured by 
the memory consistency condition GWO. 

Theorem 4. There exists a class of DSM models M. that simulates the kTAM. Each Mk G M obeys GWO. 
Further, the models in M do not obey a memory consistency condition in Steinke and NutVs lattuce that is 
stronger than GWO. 

Proof. We use a DSM system that is largely the same as the one used in Theorem [2] each processor has an 
Index register, registers r^, r^, and p. Index and p when read always returns the most recent value written 
to them, but r^ and only satisfy a weaker consistency condition, as follows. 

Let V = {EMPTY, wi, 112, . . . be the set of values that has been written to r^ (the behavior of r^ 
will follow this same condition also). Let TTg be the probability of a tile binding error in the kTAM. Then, 
when r^ is read, with probability tt^ it returns a value that simulates a glue in the tile assembly system, but 
is not a value that had been previously written to r^. With probability 1 — TTg, returns one of the values 
previously written to it. This value is determined by selecting at random from the sample space V , with 
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each of the Vi weighted by /. Lastly, if in a previous stage, returned an erroneous value, at subsequent 
stages, with probability rt, when read will return the value EMPTY; this simulates the dissociation of a 
bond. After returning the value EMPTY, no longer returns EMPTY with probability Vb, unless at some 
future stage it returns an erroneous value again. 

Execution of the algorithm proceeds in synchronous rounds, much as before. This time, though, instead 
of one processor nondeterministically choosing to be the location to act this round, all processors on the 
perimeter of the simulated assembly act if they can. (Recall that the kTAM assumes, consistent with ex- 
perimental observation, that addition and dissociation of tiles only occurs on the perimeter; we mimic that 
assumption here.) The specific algorithm is as follows. 

At stage 0, each pij that is part of the seed assembly writes the appropriate value to pij. All other p, and 
all and are initialized to EMPTY. All values of Index are set to 1. 

At stage s > 1, all pij with non-EMPTY value in pij write glues to their neighbors following the same 
method as in the aTAM simulation (i.e., reading from Index and using that value to determine which subset 
of {r^,r^} to write to). Then, each processor on the perimeter of the assembly reads the contents of its 
and r^; those registers return values probabilistically, as explained above. Each processor on the perimeter 
then writes to p, if appropriate (based on the binding rules of T), perhaps to EMPTY. To conclude the 
round, processors that sent messages at the beginning of the round increment the value of the appropriate 
Index registers, similar to the aTAM simulation. 

The behavior of each pij mirrors the behavior of the locations of the surface on which T is assembling. 
So for any legal execution of M, there is a legal tile assembly sequence in which the tiles were placed in the 
same order. Note that there is a significant change from the previous simulation: multiple processors may 
act in a round, instead of just one at a time, as in the aTAM simulation, so multiple locations may write to 
the same . That is consistent with our definition of "simulation," because for every tile assembly sequence 
of T, there will be an execution of A4 that writes to each p in the same order that locations add or remove 
tiles. The order is what matters, not the exact time step at which a change takes place. So A4 simulates the 
kTAM. 

We now show that A4 obeys GWO, but no stronger consistency model in the lattice of Steinke and 
Nutt. Recall that GWO means that there is global agreement on the order of potentially causally- related 
writes. The writes of processor p are causally related in history H to the writes of processor q only if one 
processor simulates a tile that binds or dissociates, and there is a sequence of processors (WLOG from p 
to q) S = {p,pi,p2, ■ ■ ■ ,q) , such that each processor is a neighbor of its successor, and each processor chose 
to simulate its particular tile type because of information written to it by its predecessor. There is global 
agreement on the ordering of writes, because any processor q' that could view a write w hj p' , or view a 
write causally related to is a member of a sequence S" from p' to q' as above. So Ai satisfies GWO. 

On the other hand, M. does not satisfy either of the consistency conditions immediately stronger than 
GWO in Steinke and Nutt's lattice. These two stronger conditions are causal consistency (GPO+GWO), 
and GDO-l-GWO, where GDO is equivalent to cache consistency. M does not satisfy GDO for the same 
reason as in the aTAM simulation: there is no guarantee that writes performed to a given memory location 
will return the most recent value as a read. GPO, which is equivalent to PRAM consistency, was satisfied by 
the DSM models simulating the aTAM, but is not satisfied under this model because of the possibility that 
a register will return a value that has never been written to it. This is in fact a violation of slow consistency^ 
which requires that a read return a value that has previously been written to it. Slow consistency is strictly 
weaker than PRAM consistency, hence PRAM consistency cannot be satisfied. So GWO is the strongest 
consistency condition in the lattice obeyed by M. 

As in the previous section, there is a sense in which GWO is the "tight" memory consistency condition 
for kTAM simulation. The only consistency condition in Steinke and Nutt's lattice weaker than GWO is 
local consistency, which requires that each process's local operations appear to occur in the order specified 
by its program. Without such a requirement, processors in a simulation could potentially change the value 
of p before receiving the values written to them by their neighbors, which would be inconsistent with our 
intuition of simulating tiles binding to other tiles. However, the lack of consistency conditions between local 
consistency and GWO is due only to the fact that no one has defined and studied such conditions, not because 
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it is logically impossible to do so. More precise consistency conditions for simulation of error-permitting self- 
assembly may be an area of future investigation. 

4 Self-stabilization applied to tile self-assembly 

Now that we have reduced the kTAM to models of DSM that run deterministic algorithms with sometimes- 
faulty registers, we can apply the theory of self-stabilization to prove the existence of certain error-correcting 
tile assembly systems. Recall that a distributed system is self- stabilizing if, starting from any initial state, the 
system is guaranteed to converge to, and stay in thereafter, one of a set of "legitimate" states; this research 
topic was begun by Dijkstra [24]. In the case of tile assembly, the legitimate states are the assemblies 
achievable from error- free assembly sequences. 

While not in wide use in the distributed computing literature, there is a simple, polynomial-time conver- 
sion that, given a constant-time local algorithm, yields a constant-time self-stabilizing algorithm [25] [26] . 
(See [57] for a recent exposition of this conversion, with additional motivation and examples.) Molecular self- 
assembly algorithms are inherently local, and the self-assembly literature has considered two main classes 
of self-stabilizing algorithms: proofreading tilesets |28j . which correct initial binding errors; and self-healing 
tilesets [22], which rebuild completed assemblies that have been damaged. Soloveichik et. al. recently demon- 
strated a "proof of concept," by constructing a tile assembly system that combined both proofreading and 
self-healing properties within a restricted version of the kTAM |16j . Their construction worked for tile as- 
sembly systems that only built north and east. We can use tools from self-stabilization to generalize their 
result to the full kTAM, and to locally deterministic models that grow in any direction. 

If T is a tile assembly system, the c-scaled result ofT is the colored shape on the integer plane obtained 
by "blowing up" each location in the result of T to a c x c block of tiles, such that each tile in the block is 
colored the same as the tile on the source location in the result of T . 

Theorem 5. There is a polynomial-time algorithm that does the following: upon input of a locally deter- 
ministic tile assembly system T for the kTAM, it outputs a self-healing, proofreading, tile assembly system 
T such that T builds the c-scaled result ofT, for some constant c. Further, \T'\ < Q c^\T\'^ . 

Proof. Let T be a locally deterministic tile assembly system, and let {A4,P) be the DSM model that simulates 
it, as produced by the reduction in Theorem [4] Then let P* be the self-stabilizing algorithm obtained by 
applying the conversion of [3S] [2S] to P. (Briefly, P is a constant-time algorithm. To convert P to a self- 
stabilizing algorithm, "unroll" all possible executions of P, over all possible inputs, as a circuit. The program 
P* simulates that circuit, and assumes the inputs to the circuit are correct. Then it repeats that same step k 
times, where k is large enough such that, with high probability, all the inputs at round k really are correct.) 
P* is also a constant-time algorithm, whose running time depends on the size of the P-simulating circuit 
and the value of k we choose. 

As tile assembly is Turing universal, we can of course convert P* into tiles. More importantly, there 
is some constant c such that we need lay out only c-many tiles in order to simulate the behavior P* on 
any of its legal inputs. WLOG we assume that c is large enough, and our tile simulation is "padded" if 
necessary, so that the simulation of P*, on input of tile type t, takes up a c x c square, for any t. Further, 
we dedicate a set of tile types to each input t, so the color of each tile in this c x c square is the same color 
as the input tile t. Finally, in order to ensure that self-healing does not generate nondeterministic behavior, 
we have to differentiate each tile that could potentially add tiles in more than one direction. (This is why 
the construction in [16' assumed that tiles could only grow north and east.) In other words, if a "hole gets 
punched" in a completed assembly, self-healing tiles can rebuild the empty area, by building in the reverse 
direction from the original assembly sequence. So for each tile type in T, we need T' to include a unique tile 
type that encodes "Tile type t attached to the assembly using input sides 5." It takes at least one side to 
attach, so at most three sides of t remain as output sides, which is where we need to encode the information 
to reverse the process of binding t at that location. So the number of tiles we need to place into T' to simulate 
a given tile i of T is upper-bounded by (g) . Further, we need sufficient new tile types that encode "I have 
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received information from tile type i." There are at most another (g)]?^! of those. Since for each tile type in 
T there are at most tile types that simulate P* on i, we get the overall upper- bound \T'\ < (3) c2|r|2. 

A discussion of theoretical and practical reasons to choose particular values of k (i.e., the optimal number 
of iterations to minimize the possibility of error) for DNA self-assembly appears in [16 . For our purposes, 
it suffices to use the conversion from constant-time algorithm to self-stabilizing algorithm as a black box, 
without considering the specific types of error that are most likely to occur in the kTAM. Therefore, Theo- 
rem [5] gives much weaker tile complexity bounds than the dedicated construction that appears in [161 , but 
it provides a general method that can be extended to a variety of self-assembly models, not just DNA tiling, 
and not just two-dimensional surfaces. 

Theorem [5] may be a "proof of applicability" of self-stabilization techniques to self-assembly, but more is 
needed than "just" self-stabilization. Perhaps the most troublesome form of error in nanoscale self-assembly 
experiments occurs when a tile binds incorrectly, and then other tiles bind around it, preventing it from 
dissociating. To address this in the language of self-stabilization, one needs a. fault- containing self-stabilizing 
algorithm with minimal fault gap [30 , that is to say, a self-stabilizing algorithm in which the effects of a 
processor's failure are contained within that processor's local neighborhood, and, after recovering from a 
given fault, there is only a small time gap until the system can recover from a new fault. Fujibiyashi et al. 
have suggested extending the kTAM with special tile mechanisms that would achieve a one-time-step fault 
gap with high probability |31j . though they did not phrase their results in the language of self-stabilization. 
There is some evidence that handling fault- containment probabilistically instead of deterministically will 
reduce an algorithm's fault-gap [32] , but little is known about such tradeoffs, either in self-assembly or in 
general distributed systems. 

5 Conclusion 

In this paper, we reduced the Abstract Tile Assembly Model and the Kinetic Tile Assembly Model to systems 
of distributed shared memory with particular memory consistency conditions. We then applied the reductions 
to show that (1) local determinism is closely related to concurrent-write freedom in parallel programming, 
and (2) the theory of self-stabilization can be usefully applied to questions of error correction in self-assembly. 

We focused on the aTAM and the kTAM because they have been the most theoretically studied self- 
assembly models. However, both models are limited to the binding of DNA tiles to other DNA tiles, and as 
the recent nanofabrication survey [5] points out, the "greatest promise" of algorithmic DNA self-assembly 
"may lie in applications where DNA nanostructure templates have been used to assemble other inorganic 
components and functional groups." The IBM/Caltech microchip project is an example of this research 
direction. Therefore, we believe it is critical to develop a programming theory for "mixed-media" models of 
self-assembly (such models by-and-large do not yet exist), and that programming theory may be advanced 
by continuing the investigation begun in this paper. 

From the perspective of "pure theory," there has been initial work to classify shared read/ write vari- 
ables [33], much as Steinke and Nutt classified known memory consistency systems. It would be useful 
to explore further the weak consistency, like GWO, offered by registers that simulate the binding of self- 
assembling agents, whether in DNA or another medium. It would also be useful to explore what "more 
robust" registers (like the consensus objects in [T3]) could be built, to know how agents might cooperate to 
form more fault-tolerant structures. Lastly, we believe it would be productive to explore further the relation- 
ship between self-assembling structures and the placing of geometric constraints on local and self-stabilizing 
algorithms. 
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